User-directed Exploration of Mining Space with Multiple Attributes

نویسندگان

  • Chang-Shing Perng
  • Haixun Wang
  • Sheng Ma
  • Joseph L. Hellerstein
چکیده

There has been a growing interest in mining frequent itemsets in relational data with multiple attributes. A key step in this approach is to select a set of attributes that group data into transactions and a separate set of attributes that labels data into items. Unsupervised and unrestricted mining, however, is stymied by the combinatorial complexity and the quantity of patterns as the number of attributes grows. In this paper, we focus on leveraging the semantics of the underlying data for mining frequent itemsets. For instance, there are usually taxonomies in the data schema and functional dependencies among the attributes. Domain knowledge and user preferences often have the potential to significantly reduce the exponentially growing mining space. These observations motivate the design of a userdirected data mining framework that allows such domain knowledge to guide the mining process and control the mining strategy. We show examples of tremendous reduction in computation by using domain knowledge in mining relational data with multiple attributes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Amplitude versus Offset (AVO) Technique for Light Hydrocarbon Exploration: A Case Study

AVO as a known methodology is used to identify fluid type and reservoir lithology in subsurface exploration. Method discussed in this paper, consists of three stages, including: Direct modeling, Inverse modeling and Cross plot interpretation. By direct modeling we can clarify lithology or fluid dependent attributes. Analysis performed using both P-P and P-Sv attributes. Inverse modeling deals w...

متن کامل

Towards a Framework for Semantic Exploration of Frequent Patterns

Mining frequent patterns is an essential task in discovering hidden correlations in datasets. Although frequent patterns unveil valuable information, there are some challenges which limits their usability. First, the number of possible patterns is often very large which hinders their effective exploration. Second, patterns with many items are hard to read and the analyst may be unable to unders...

متن کامل

Geo-visualization Support for Multidimensional Clustering

In this paper we consider how multidimensional clustering can be complemented by interactive visualization. We propose a link between geovisualization and data mining systems for supporting an iterative analysis cycle, including data pre-processing and visual exploration, automatic detection of clusters in multidimensional space of user-selected attributes, and visual analysis of cluster analys...

متن کامل

Interactive Visualization of the Market Graph

Financial markets are a fruitful area for data exploration, but the overwhelming size and dimension of the datasets usually prohibit meaningful analysis, especially on a large scale. Thus, there is a need for effective visualization tools to assist in efficiently exploring the data space. In this paper, we present a novel visualization tool that empowers a user with an interactive tool for find...

متن کامل

Connecting Segments for Visual Data Exploration and Interactive Mining of Decision Rules

Visualization has become an essential support throughout the KDD process in order to extract hidden information from huge amount of data. Visual data exploration techniques provide the user with graphic views or metaphors that represent potential patterns and data relationships. However, an only image does not always convey high–dimensional data properties successfully. From such data sets, vis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002